Confidence measures for keyword spotting using support vector machines

نویسندگان

  • Yassine Ben Ayed
  • Dominique Fohr
  • Jean Paul Haton
  • Gérard Chollet
چکیده

Support Vector machines (SVM) is a new and very promising classification technique developed from the theory of Structural Risk Minimisation [1]. In this paper, we propose an alternative out-of-vocabulary word detection method relying on confidence measures and support vector machines. Confidence measures are computed from phone level information provided by a Hidden Markov Model (HMM) based speech recognizer. We use three kinds of average techniques as arithmetic, geometric and harmonic averages to compute a confidence measure for each word. The acceptance/rejection decision of a word is based on the confidence feature vector which is processed by a SVM classifier. The performance of the proposed SVM classifier is compared with methods based on the averaging of confidence measures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition and Rejection Performance in Wordspotting Systems Using Support Vector Machines

Support Vector Machines (SVM) is one such machine learning technique that learns the decision surface through a process of discrimination and has a good generalization capacity [6]. SVMs have been proven to be successful classifiers on several classical pattern recogntion problems [9, 11]. In this paper, one of the first applications of Support Vector Machines (SVM) technique for the problem of...

متن کامل

Semi-Supervised Keyword Spotting in Arabic Speech Using Self-Training Ensembles

Arabic speech recognition suffers from the scarcity of properly labeled data. In this project, we introduce a pipeline that performs semi-supervised segmentation of audio then— after hand-labeling a small dataset—feeds labeled segments to a supervised learning framework to select, through many rounds of hyperparameter optimization, an ensemble of models to infer labels for a larger dataset; usi...

متن کامل

Keyword Spotting Using Normalization of Posterior Probability Confidence Measures

Keyword Spotting Using Normalization of Posterior Probability Confidence Measures by Rachna Vijay Vargiya Thesis Advisor: Marius C. Silaghi, Ph.D. Keyword spotting techniques deal with recognition of predefined vocabulary keywords from a voice stream. This research uses HMM based keyword spotting algorithms for this purpose. The three most important componenets of a keyword detection system are...

متن کامل

بهبود کارایی سیستم کاوشگر کلمات تلفنی با استفاده از نرمالیزاسیون امتیاز اطمینان مبتنی بر روش برنامه‌ریزی خطی

Conventional word spotting systems determine hypothesized keywords and their confidence score using a speech recognizer. Acceptance or rejection of these keywords is intended based on comparison of their scores with a specific threshold. It has been proved that confidence score prepared by recognizer is highly dependent on sub-word structure of each keyword. So comparing assigned scores to keyw...

متن کامل

Improving performance of a keyword spotting system by using a new confidence measure

This work describes a HMM-based keyword spotting system. In this system, keywords are modeled as concatenations of the corresponding phoneme models, consequently, no specific databases are needed to train the system. In addition no filler models are required, therefore small computational requirements are necessary. Two main stages define the whole system. The first stage is based on a previous...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003